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Abstract 

The purpose of this study is to prepare a multiple-choice achievement test with high reliability and validity for 
the “Let’s Solve the Puzzle of Our Body” unit. For this purpose, a multiple choice achievement test consisting of 
46 items was applied to 178 fifth grade students in total. As a result of the test and material analysis performed 
during the test development process, difficulty, distinctiveness, and item-total correlation coefficients of the 
materials were calculated. For the validity study, a table of specifications was prepared and the Content Validity 
Index (CVI) was found to be 0.95 by taking an expert opinion. As a result of the analysis, 8 items were removed 
from the test and the KR-20 reliability coefficient of the final test consisting of 38 items was calculated as 0.87. 
As a result of the item analyses, while item difficulty indices were valued between 0.30 and 0.74, item 
distinctiveness indeces were valued between 0.31 and 0.71. The average difficulty of the test was calculated as 
moderate (0.56) and its distinctiveness was calculated as very good (0.49). 

Keywords: biology, developing achievement test, item analysis, reliability, science education, validity 

1. Introduction 

In the researches related to the 5 th grade biology subjects covering the transition period from primary school to 
secondary school, it is revealed that the students have deficient level of knowledge and alternative concepts in 
“Nutrients and their characteristics ”, “Digestion of nutrients ” and “ Excretory System in our body ” which are 
included in “Let’s Solve the Puzzle of Our Body” unit (Banet & Niinez, 1997; Carvalho, Silva, & Clement, 2003; 
Giingor, 2009; Giingor & Ozgiir, 2009; Niinez & Banet, 1997; Patrick & Tunnicliffe, 2010). The deficient level 
of knowledge level of the students from their early ages led the researchers to make studies about eliminating the 
lack of information and alternative concepts of the students in these subjects. 

In educational studies, many measurement tools such as interviews, open-ended questions, concept maps, tests 
can be used to determine the level of students’ understanding of knowledge and concepts. Qualitative ones from 
these researches can work with fewer participants, enabling more in-depth research; however, with quantitative 
ones a large audience can be reached with more participants (Griffard, 2001). Multiple choice tests are very 
suitable measurement tools for determining the level of knowledge of different subjects of many students at 
different academic levels (Burton, Sudweeks, Merrill, & Wood, 1991). Multiple-choice tests also enable students 
to determine the misconceptions they have by including inaccuracies they have in the options (Treagust, 1988). 

When the literature is examined, there are many researches carried out with the 6 th and 7 th grade (Akgiindiiz, 
2013; Aydede & Matyar, 2009; Erdogan, 2010; Inel, 2009; Kiras, 2013) or researches carried out with the 
university students and the teachers, in relation to sub topics and concepts in the content of the “Zei ’s Solve the 
Puzzle of Our Body” unit (Qardak, 2005; Patrick & Tunnicliffe, 2010; Prokop & Faneovieova, 2006). However, 
there are very few studies related to Let’s Solve the Puzzle of Our Body unit in 5 th grade (Giingor & Ozgiir, 
2009). As there are only a few researches related to this subject, it has been needed to develop a reliable and 
valid mesurement tool for “Let’s Solve the Puzzle of Our Body” unit since it is one of the essential requirements 
especially for science teaching of the 5 th grade, which is the first step of the secondary school grade. 
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2. Method 

2.1 Sample 

Pilot application of the study was carried out with a total of 178 5 th grade students, 89 female and 89 male 
students studying at two central schools in Samsun city of Turkey. The schools of whom have been randomly 
selected by lot, among the schools belonging to Turkish Ministry of National Education without considering 
their academic success. The distribution of the sample according to the schools and sex is shown in Table 1. 


Table 1. The number of schools and students participated in the pilot study 


Gender 

Secondary School A 

Secondary School B 


Total 

f 

% 

1 

% 

f 

% 

Girl 

43 

48.3 

45 

50.6 

88 

49.4 

Boy 

46 

51.7 

44 

49.4 

90 

50.6 

Total 

89 

100 

89 

100 

178 

100 


2.2 Data Collecting Instrument 

In this research. Let’s Solve the Puzzle of Our Body Unit Achievement Test was used as a collecting data tool. The 
aim of using multiple-choice testing as an achievement test is to allow the ability to measure many sub-concepts of 
the unit taught in the research, to make it easy to evaluate and to enable to measure how much it has been learned 
(Marx et al., 2004). 

The achievement test was prepared by taking into consideration the objectives of the “ Let’s Solve the Puzzle of 
Our Body ” unit which is included in the 5 th grade Ministry of National Education (MoNE) Science Curriculum 
to be used in the research. Regarding the level of readiness of 5 th grade students, 46 multiple choice test items 
with four options were created. While creating the test items, the questions of the achievement test were created 
by the researcher by examining the 5 th grade Science course books prepared by the Ministiy of National 
Education (Erten, 2015; Karaca, 2014), leaf tests related to “ Let’s Solve the Puzzle of Our Body ” unit and the 
exams conducted by the Ministry of National Education for the all secondary school students across the country. 

2.3 General Information about Unit 

In the context of Let’s Solve the Puzzle of Our Body unit there are three subtopics: Nutrients and their 
characteristics, digestion of nutrients and excretory system in our body. In the Ministry of National Education 
Science Curriculum in total 13 objective were given as 36 lesson hours. The sub topics and contents related to the 
unit are shown in Table 2. 


Table 2. Content of the “ Let’s Solve the Puzzle of Our Body ” unit in science curriculum (MoNE, 2015) 


Unit Subjects 

Subject/Concepts 

Lesson 

Hours 

The Number of 

Objectives 

Nutrients and Their Characteristics 

Nutriments, balanced nutrition, harms of smoking and alcohol 

12 

6 

Digestion of Nutrients 

Structures and organs of digestion, transportation of nutrients in 
body, nutrients digestion, tooth and dental health 

12 

4 


Structures and organs of excretory, structures and organs that 



Excretory System in Our Body 

enable removal of effluents and noxious substances out of body, 
types of excretory, kidney health 

12 

3 


2.4 Data Analysis 

In the analysis of the data obtained during the development of the test, for each item, standard deviation, 
arithmetic mean, item distinctiveness, item difficulty, Kolmogorov-Smirnov test for the normality test, biserial 
correlation coefficients in item total score correlation and KR-20 reliability coefficient in reliability calculations 
were used and calculated statistically. 
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3. Findings 

According to (^elik (2000) there are steps to follow when developing an achievement test; planning, item writing, 
item analysis and item selection. After investigating the research on the achievement test development in the 
field (Bakioglu, Karamustafaoglu, S., & Karamustafaoglu, O., 2014; £ahk & Ayas, 2003; Tosun & Ta^kesenligil, 
2011) and examining the related literature, the steps taken during the test development process in this study are 
summarized in Figure 1 briefly. 


Examination of 
achievement test 
development studies in 
the literature 


Examination of course 
books, internet websites, 
and test books in literature 
related with the unit 


Examination of 
objectives, subjects and 
concepts stated in the 
Science Curriculum 


Creating Item Pool 


Talcing an Expert’s Opinion 




Taking the Opinions of 
Science Teachers 


Creating the Pretesting Fonn 


Conducting the Pilot Application 


Taking an Expert’s Opinion 


Creating the Pilot Fonn 


Conducting the Pilot Application 


Conducting the Test Statistics and Item Analysis 

u ;i u;i li li mm llu uuu Lij.i|i|i|yiaaiiaaiiaiiaaiiaaauj y y y y wwmMJt 


Creating the Last Achievement Test 



Figure 1. The process of developing achievement test 
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3.1 The Study of Validity 

While preparing the achievement test for the “Let’s Solve the Puzzle of Our Body ” unit in the research, at least 
three test items related to objective were formed. While creating the items, an expert was consulted in order to 
ensure validity. 

For the content validity of the achievement test two faculty members from Ondokuz Mayis University Science 
Teaching Department, four doctoral students and four science teachers, totally 10 people were consulted. For 
each item found in the pilot achievement test, a graded “ Expert Evaluation Form ” was given to the experts. For 
each item in this form three grades were given: appropriate, must-be-corrected and must-be-excluded. According 
to the opinions obtained from the opinion form, the Content Validity Rates (CVR) were calculated for each item 
(formula 1). CVR is calculated by subtracting one from the division of the number of experts who marked the 
“required ” option to the half of number of total experts (Yurdagiil, 2005). 


formula 1 


NA 

CVR = - - 1 

N/2 


NA: The number of experts who are approving the test items as appropriate. 

N: The total number of experts who states opinions related to test items. 

CVR: Content Validity Rates. 

In Table 3, minimum values of CVR at a=.05 significance level are included for an expert opinion according to 
Veneziano and Hooper (1997). When interpreted according to this table; 10 expert opinions are used in the 
content validity calculations of the achievement test questions used in this study, therefore, to provide 
significance statistically according to expert numbers, for 10 experts 0.62 value was used as the Content Validity 
Criterion (CVC). 


Table 3. Minimum values of CVR according to expert opinion number (Veneziano & Hooper, 1997) 


Number of 

Specialist 

Min. Value 

Number of 

Specialist 

Min. Value 

Number of 

Specialist 

Min. Value 

5 

0.99 

10 

0.62 

15 

0.49 

6 

0.99 

11 

0.59 

16 

0.42 

7 

0.99 

12 

0.56 

17 

0.37 

8 

0.78 

13 

0.54 

18 

0.33 

9 

0.75 

14 

0.51 

19 

0.31 


In the study conducted, all items from the 46-item achievement test were taken into the application form since no 
item had a lower value than 0.62 which is the Content Validity Criterion (CVC) for 10 experts. Afterwards, 
CVRs were collected and the total validity index of the scale was obtained. As a result of the calculations, the 
Content Validity Index (CVI) of the scale was found to be 0.95 and since CVI>CVR, the content validity of the 
scale was found to be significant statistically (Yurdagiil, 2005). 

For the face validity of the achievement test, a faculty member from the Department of Science Education, a 
Science teacher and a language expert were consulted and the necessary corrections were made in the direction 
of incoming feedbacks. According to these feedbacks, some distractors are at a level that students have difficulty 
in understanding and some questions are very long and there are two negations in the same question. As a result 
of the expert examination, no item was eliminated and the pilot application was prepared by making the 
corrections in the direction of suggestions. So as to determine the content validity of the test, the indicator chart 
which consists of the unit objectives has been prepared and each one of the objectives has at least three items. 
The indicator chart related to the content validity has been given Table 4. 

The pilot application of a total 46 multiple choice test with 22 items related to “ Nutrients and their 
Characteristics”, 13 items of “Digestion of Nutrients” and 11 items of “Excretory System in Our Body” included 
in Let’s Solve the Puzzle of Our Body unit was carried out (Table 4). 
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Table 4. The distribution of test items according to subjects and objectives 


Subjects 

Objectives 

Item No 

Number of Item 


It recognizes that nutrient content is essential for the vital activities of 
living things. 

1,2, 3,4, 5, 6* 




It searches and provides information on which nutrients have the most 

vitamins. 

7, 8, 9*, 10* 



Nutrients and 

It deduces that water and minerals are present in all nutrients. 

11*, 12, 13 



Their 

Characteristics 

It searches and presents the effects of balanced nutrition on human 

health. 

14, 15, 16 

22 



It discusses the importance of freshness and naturalness of nutrients for a 
healthy life based on the research data. 

17, 18, 19* 




It discusses the damage of smoking and alcohol to the body based on the 

research data. 

20,21,22 




It demonstrates the position of structures and organs in digestion on the 
model respectively. 

23, 24, 25*, 26, 36* 



Digestion of 

Nutrients 

It explains the types of teeth by showing them on the model. 

It cares for nutrition, cleaning and regular teeth control for dental health. 

27,28,29,31 

30, 32, 33 

13 



It deduces that nutrients are transported by blood in body after digestion. 

34, 35, 37 




It recognizes the structures and organs in excretion. 

36*, 37, 38,43 




It deduces that there are different types of excretion in the body and that 




Excretory System 

harmful substances emerged as result of the excretory activities must be 

39, 40, 41*, 42, 44 

11 


in Our Body 

thrown out of the body. 

It searches and presents what must be considered to protect kidney 

health. 

43,45,46 





Total 


46 


* Items eliminated as result of pilot application. 


3.2 Normality Test 

Kolmogorov-Smirnov which is one of the normality tests was applied to test the suitability of the normal 
distribution of the data obtained from the achievement test. The fact that the p value calculated as the result of 
the analysis is higher than .05 is interpreted as the scores do not show any significant (extreme) deviation from 
the normal distribution at this significance level (Buyiikoztiirk, 2010). Accordingly, the Kolmogorov-Smirnov 
test results show that achievement test scores of the students does not show any significant difference from the 
normal distribution (D(178)=.047; p=,200; p<.05). 

3.3 The Item Difficulty’ and Distinctiveness 

In the rating of the results obtained from the achievement test, the total score of each student was calculated by 
giving 1 point to the correct answers and 0 point to the wrong answers, unanswered questions and to those who 
marked more than one answer for the same question. The test results obtained after rating are ranked from the 
highest to the lowest. Item analysis was performed by creating groups in a way that the first 27% (N=48) of the 
score ranking constitute the upper group and the last 27% constitute the lower group and by using Microsoft 
Excel and SPSS programs for the answers given by the students for each item. 

About the levels of item difficulty, it is considered that if the item difficulty index (pj) is between 0.00-0.19 the 
item is very difficult, if it is between 0.20-0.34 the item is difficult, if it is between 0.35-0.64 the item has medium 
difficulty, if it is between 0.65-0.79 the item is easy and if it is between 0.80-1.00 the item is very easy (Sozbilir, 
2010). In the results of item analysis related to each item in the achievement test, item difficulty index values vary 
from 0.30 to 0.74. 

Item distinctiveness is the comparison of the average of the scores that end groups such as upper and lower groups 
give each item when they are ranked from the highest to the lowest according to the total scores obtained from the 
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scale (Tav^ancil, 2006). As a result of the item analysis related to each item in the achievement test, in choosing to 
decide which item will remain in the test, as item distinctiveness index (rjx) it is considered that if rjx<0.19 the 
item is unacceptable, if it is between 0.20-0.29 the item must be revised, if it is between 0.30-0.39 the item is 
good/acceptable and if 0.40<rjx the item is very good/acceptable (Ozpelik, 2010). In the achievement test 
developed as 46-items, arithmetic mean, standard deviation, variance, reliability, item distinctiveness, item 
difficulty, item correlations calculations were performed. Items whose distinctiveness index is lower than 0.30 (9 th , 
10 th , 11 th , 19 th , 25 th , 36 th , 41 th ) were excluded from the test. However, 7 th item whose distinctiveness index is 0.27 
was not excluded because four items in total were created (7 th , 8 th , 9 th , 10 th ) related to the objective it qualifies. 
Other items (9 th , 10 th ) were eliminated since their distinctiveness were low and 7 th item was not excluded since in 
case of its elimination there would have been only one item (8 th ) related to the relevant objective. The 
distinctiveness of the 6 th item is 0.31. However, there are 6 items in total related to the objective it qualifies (1 th , 2 th , 
3 th , 4*, 5 th , 6 th ). For this reason, the elimination of the 6 th item, which has the lowest distinctiveness among these 
items, was deemed appropriate (Table 5). 

Table 5. The item analysis of pilot test according to upper-lower group correct answer scores 


Item No 

Upper group correct 

answer score 

Lower group correct 

answer score 

Pj 

Difficulty 

Level 

rjx 

Result 

Status 

1 

46 

24 

0.73 

Easy 

0.46 

VG 

V 

2 

45 

17 

0.65 

Easy 

0.58 

VG 

V 

3 

46 

19 

0.68 

Easy 

0.56 

VG 

V 

4 

40 

12 

0.54 

Average 

0.58 

VG 

V 

5 

36 

8 

0.46 

Average 

0.58 

VG 

V 

6 

34 

19 

0.55 

Average 

0.31 

G 

- 

7 

21 

8 

0.30 

Hard 

0.27 

R 

V 

8 

33 

13 

0.48 

Average 

0.42 

VG 

V 

9 

33 

22 

0.57 

Average 

0.23 

R 

- 

10 

24 

15 

0.41 

Average 

0.19 

D 

- 

11 

8 

6 

0.15 

Very Hard 

0.04 

D 

- 

12 

42 

11 

0.55 

Average 

0.65 

VG 

V 

13 

26 

9 

0.36 

Average 

0.35 

G 

V 

14 

43 

18 

0.64 

Average 

0.52 

VG 

V 

15 

39 

17 

0.58 

Average 

0.46 

VG 

V 

16 

38 

17 

0.57 

Average 

0.44 

VG 

V 

17 

43 

25 

0.71 

Easy 

0.38 

G 

V 

18 

44 

20 

0.67 

Easy 

0.50 

VG 

V 

19 

25 

14 

0.41 

Average 

0.23 

R 

- 

20 

26 

4 

0.31 

Hard 

0.46 

VG 

V 

21 

45 

16 

0.64 

Average 

0.60 

VG 

V 

22 

35 

11 

0.48 

Average 

0.50 

VG 

V 

23 

42 

10 

0.54 

Average 

0.67 

VG 

V 

24 

41 

20 

0.64 

Average 

0.44 

VG 

V 

25 

20 

6 

0.27 

Hard 

0.29 

R 

- 

26 

46 

19 

0.68 

Easy 

0.56 

VG 

V 

27 

41 

21 

0.65 

Easy 

0.42 

VG 

V 

28 

42 

11 

0.55 

Average 

0.65 

VG 

V 
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29 

44 

20 

0.67 

Easy 

0.50 

VG 

V 

30 

47 

23 

0.73 

Easy 

0.50 

VG 

V 

31 

35 

10 

0.47 

Average 

0.52 

VG 

V 

32 

25 

9 

0.35 

Average 

0.33 

G 

V 

33 

47 

23 

0.73 

Easy 

0.50 

VG 

V 

34 

38 

8 

0.48 

Average 

0.63 

VG 

V 

35 

43 

13 

0.58 

Average 

0.63 

VG 

V 

36 

18 

8 

0.27 

Hard 

0.21 

R 

- 

37 

36 

9 

0.47 

Average 

0.56 

VG 

V 

38 

34 

13 

0.49 

Average 

0.44 

VG 

V 

39 

30 

13 

0.45 

Average 

0.35 

G 

V 

40 

41 

14 

0.57 

Average 

0.56 

VG 

V 

41 

23 

11 

0.35 

Average 

0.25 

R 

- 

42 

34 

14 

0.50 

Average 

0.42 

VG 

V 

43 

44 

21 

0.68 

Easy 

0.48 

VG 

V 

44 

34 

9 

0.45 

Average 

0.52 

VG 

V 

45 

33 

17 

0.52 

Average 

0.33 

G 

V 

46 

32 

15 

0.49 

Average 

0.35 

G 

V 

Average 



0.52 


0.44 




VG: Very good, G: Good, R: to be Revised, D: to be Discarded. 


3.4 The Item Correlation 

In item total correlation, biserial correlation coefficient was used. Biserial correlation coefficient is used to 
calculate the amount of the relationship between a continuous variable and a variable which is actually 
continuous but was made discontinuous and artificially with two categories (Buyiikoztiirk, (,’okluk, & Koklii, 
2010). In this context, there is a relationship between the score obtained from the sum of the achievement test 
(continuous variable) and the score obtained from each item of the test. Biserial correlation coefficient was 
calculated for each item in the test by giving 1 point to the correct answers and 0 point to the wrong and 
unanswered questions. 

Item total correlation explains the relationship between the total score that respondents receive from the 
assessment instrument and the score they receive from each item. The fact that item total correlation is positive 
and high indicates that scale items show similar behaviour and that internal consistency of the test is high 
(Biiyukbzturk, 2010). If the total score and correlation of any item is low, it indicates that that item scales a 
different feature than the other items. Item total correlation should not be negative and it must be at least 0.20. 
When biserial correlation coefficient of each item included in the achievement test was calculated, the 
correlation coefficient values of the 9 th , 10 th , 11 th , 19 th and 41 th items were found to be below 0.30. These 
question items are items that were eliminated in the calculation of the item difficulty and distinctiveness made 
earlier since their values were low. The fact that the correlation between items is high indicates that items are 
homogeneous and therefore highly reliable (Tav§anctl, 2006). After the item elimination of the achievement test 
was performed, the distribution of the test items according to the subjects and objectives included in the unit is 
stated in Table 6. 
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Table 6. The distribution of test items according to subjects and objectives 

Subjects 

Objectives 

Item No 

Number 

of Item 


It recognizes that nutrient content is essential for the vital activities of living things. 

1,2, 3,4, 5 



It searches and provides information on which nutrients have the most vitamins. 

6,7 


Nutrients and 

Their 

It deduces that water and minerals are present in all nutrients. 

It searches and presents the effects of balanced nutrition on human health. 

8,9 

10, 11, 12 

17 

Characteristics 

It discusses the importance of freshness and naturalness of nutrients for a healthy life 

based on the research data. 

13, 14 



It discusses the damage of smoking and alcohol to the body based on the research 

data. 

15, 16, 17 



It demonstrates the position of structures and organs in digestion on the model 
respectively. 

18, 19, 20 


Digestion of 

Nutrients 

It explains the types of teeth by showing them on the model. 

It cares for nutrition, cleaning and regular teeth control for dental health. 

21,22,23,25 

24, 26, 27 

13 


It deduces that nutrients are transported by blood in body after digestion. 

28,29,30* 



It recognizes the structures and organs in excretion. 

30*, 31,35* 


Excretory 

It deduces that there are different types of excretion in the body and that harmful 



System in Our 

substances emerged as result of the excretory activities must be thrown out of the 

32,33,34,36 

9 

Body 

body. 




It searches and presents what must be considered to protect kidney health. 

35*, 37, 38 




Total 

38 


* Test items that scales more than one objective. 


A total of 38 multiple-choice test items, 17 of which are related to “ Nutrients and their characteristics ”, 13 of 
which are related to “ Digestion of nutrients ” and 9 of which are related to “ Excretory in Our Body ” are included 
in the final achievement test (Table 6). 

The arithmetic means and standard deviation values of the items of the test finalized according to the item 
analysis performed as a result of the pilot application of the pilot achievement test are given in Table 7. As a 
result of the item analysis of the achievement test, 6 th , 9 th , 10 th , 11 th , 19 th , 25 th , 36 th , 41 th items were excluded and 
the difficulty and distinctiveness values stated in Table 7 for each of the other 38 items in the test were received. 
As a result of the item analysis, the distinctiveness of all questions was calculated above 0.30. 


Table 7. The item analyse results of last test 


Item No* 

Item No** 

Upper group correct 

answer score 

Lower group correct 

answer score 

rjx 

Level of 

distinctiveness 

Pj 

Level of 

difficulty 

1 

1 

46 

25 

0.44 

VG 

0.74 

Easy 

2 

2 

45 

18 

0.56 

VG 

0.66 

Easy 

3 

3 

46 

19 

0.56 

VG 

0.68 

Easy 

4 

4 

40 

9 

0.65 

VG 

0.51 

Average 

5 

5 

36 

8 

0.58 

VG 

0.46 

Average 

7 

6 

22 

7 

0.31 

G 

0.30 

Hard 

8 

7 

32 

15 

0.35 

G 

0.49 

Average 

12 

8 

41 

11 

0.63 

VG 

0.54 

Average 

13 

9 

27 

7 

0.42 

VG 

0.35 

Hard 
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14 

10 

40 

20 

0.42 

VG 

0.63 

Average 

15 

11 

40 

17 

0.48 

VG 

0.59 

Average 

16 

12 

38 

16 

0.46 

VG 

0.56 

Average 

17 

13 

44 

26 

0.38 

G 

0.73 

Easy 

18 

14 

44 

20 

0.50 

VG 

0.67 

Easy 

20 

15 

26 

5 

0.44 

VG 

0.32 

Hard 

21 

16 

44 

17 

0.56 

VG 

0.64 

Average 

22 

17 

34 

11 

0.48 

VG 

0.47 

Average 

23 

18 

42 

11 

0.65 

VG 

0.55 

Average 

24 

19 

43 

21 

0.46 

VG 

0.67 

Easy 

26 

20 

46 

21 

0.52 

VG 

0.70 

Easy 

27 

21 

44 

20 

0.50 

VG 

0.67 

Easy 

28 

22 

44 

10 

0.71 

VG 

0.56 

Average 

29 

23 

45 

19 

0.54 

VG 

0.67 

Easy 

30 

24 

46 

23 

0.48 

VG 

0.72 

Easy 

31 

25 

34 

10 

0.50 

VG 

0.46 

Average 

32 

26 

26 

10 

0.33 

G 

0.38 

Average 

33 

27 

47 

24 

0.48 

VG 

0.74 

Easy 

34 

28 

39 

7 

0.67 

VG 

0.48 

Average 

35 

29 

43 

14 

0.60 

VG 

0.59 

Average 

37 

30 

36 

6 

0.63 

VG 

0.44 

Average 

38 

31 

33 

14 

0.40 

VG 

0.49 

Average 

39 

32 

32 

13 

0.40 

VG 

0.47 

Average 

40 

33 

41 

14 

0.56 

VG 

0.57 

Average 

42 

34 

35 

15 

0.42 

VG 

0.52 

Average 

43 

35 

43 

22 

0.44 

VG 

0.68 

Easy 

44 

36 

35 

7 

0.58 

VG 

0.44 

Average 

45 

37 

34 

15 

0.40 

VG 

0.51 

Average 

46 

38 

32 

16 

0.33 

G 

0.50 

Average 





0.49 

VG 

0.56 

Average 


* The item numbers of pilot test. 
** The item numbers of last test 
VG: Very good, G: Good. 


Table 8. Achiement test statistics found as a result of item analysis 


Achievement Test 

Number of Item 

N 

Mean 

Variance 

Std. Deviation 

Average Difficulty 

KR-20 

Pilot Test 

46 

178 

23.83 

66.65 

8.16 

0.52 

0.86 

Last Test 

38 

178 

21.08 

57.81 

7.60 

0.56 

0.87 


Arithmetic mean, standard deviation, variance, difficulty and reliability calculations of 38 items were repeated in 
the final achievement test (Table 8). In the research, the average difficulty of the pilot and final achievement 
tests was found to be moderate. The average difficulty of the achievement tests must be 0.50 so that they can 
serve the feature that is scaled and they can be highly reliable (Kan, 2012). 


262 




jel.ccsenet.org 


Journal of Education and Learning 


Vol. 6, No. 2; 2017 


3.5 The Study of Reliability 

In the reliability calculation of the achievement test, the KR-20 reliability coefficient was calculated. The KR-20 
is suitable for determining the reliability coefficient of tests in which each item in is parallel to each other, which 
has the same mean and variance and which was scored by giving one point to the correct answers for each 
question, and not giving any point to the wrong answers or unanswered questions (Baykul, 2010; Tekin, 2000). 
The reliability coefficient value was calculated as 0.86 as a result of the Kuder Richardson 20 (KR-20) 
calculation of the pilot achievement test whereas, after the elimination of the eight items as result of the item 
analysis KR-20 reliability coefficient was calculated as 0.87 (Table 8). An assessment instrument whose KR-20 
reliability coefficient is 0.70 or higher is acknowledged as reliable (Fraenkel & Wallen, 2006; Oz 9 elik, 2010; 
Saipanish, Hiranyatheb, & Lotrakul, 2015). Therefore, this achievement test is considered as reliable. As a result 
of the item analysis, the achievement test consisting of 38 multiple choice items was finalized and prepared for 
using in the research. 

4. Conclusion and Suggestions 

The assessment and evaluation process is important in terms of assessing the effectiveness of science teaching. 
One of the frequently used assessment instruments in the assessment and evaluation studies is multiple-choice 
achievement tests. Today multiple choice tests are one of the most widely used assessment instalments which 
allows comprehensive assessment of achievement and easy scoring for the practitioner by providing many 
questions in a short period of time (Burton, Sudweeks, Merrill, & Wood, 1991; Bagcan Biiyiikturan & Qiknkgi 
Demirta^h, 2012; Treagust, 1988). Test is an assessment instalment easy to apply and score in the assessment 
and evaluation process since it consist of multiple choice items. For this reason, the aim of this study is to 
develop a reliable and valid assessment instalment which can assess the achievement of students related to the 
fifth grade science course “Let ’s Solve the Puzzle of Our Body ” unit. 

In the process of developing the test, firstly the pilot application of the test and test and item analysis were 
performed. As a result of the item analysis of the achievement test consisting of 46 items in total, the final test 
consisting of 38 items was created by eliminating 8 items. A table of specifications showing the relationship 
between the test items created in terms of content validity and the objectives included in the Ministry of National 
Education Science Curriculum was prepared. In addition, the Content Validity Index (CVI) of the test was 
calculated to be 0.95 by taking expert opinions for each item. As a result of the item analyses carried out during 
the test development process; item difficulties were calculated between 0.30-0.74, item distinctiveness index 
were calculated between 0.31-0.71, and item-total score biserial correlation coefficients were calculated between 
0.30-0.66. While calculating the KR-2 reliability coefficient of the final test, the average difficulty of the test was 
found to be moderate and its average distinctiveness was found to be very good. The results show that the 
achievement test is reliable and valid in terms of evaluating the academic achievements of the fifth grade 
students related to the “Let’s Solve the Puzzle of Our Body ” unit. 

When the literature on digestive system, excretory system, nutrients and nutrient types are examined, it is seen 
that college students carried out studies on 6 th and 7 th grade students at secondary school (Alkan Dilbaz, 2013; 
Giigliier, 2012; Giingor & Ozgiir, 2009; Patrick & Tunnicliffe, 2010; Prokop & Faneovieova, 2006; Yildinm, 
2012). This test will enable Piaget to identify the deficiencies in knowledge in the biology field during the 
transitional period of the 5 th grade students transitioning from the concrete process period to the abstract process 
period. 

It is believed that this assessment instrument can help to determine the readiness level of the 5 th grade students 
and their lack of knowledge in subtopics and that it can help the scientific studies of the researchers conducting 
experimental research. In the direction of the results obtained from this research, the following suggestions have 
been made: 

- With this developed achievement test, level and lack of knowledge of the students in “ Nutrients and Their 
Characteristics ”, “ Digestion of Nutrients' ’ and “ Excretory System in Our Body ” subtopics included in “ Let’s 
Solve the Puzzle of Our Body” unit can be determined in the transition period of the students to secondary 
school. 

- The developed achievement test can help students to organize their learning activities according to their 
determined deficiencies by determining their readiness and deficiencies in terms of the 5 th grade biology 
subjects. 

- With the developed achievement test, it is possible to determine the misconceptions in the students by 
examining their level of knowledge and deficiencies as well as the questions they answered wrong. Because 
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the distracters of each item in the test were prepared according to the misconceptions that students have in 
relation to the topic. 

- The developed achievement test can be used as a data collection tool for other researches to be carried out 
in the field of science education. 
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Appendix A 

The Academic Achievement Test of Let’s Solve the Our Body Puzzle Unit 


Question 1) 

Which of these nutrition below are the most fuel nutrients 
in comparison to the others? 

A. Rice-Sugar 

B. Cracked wheat-Orange 

C. Meat-Egg 

D. Pasta-Spinach 

Question 2) 

“Selin had cheese, bread and honey on breakfast; pasta on 
her lunch; French fries and rice on dinner 

Regarding the nutriment that Selin ate for a day, try to find 
out what kind of food Selin takes in her body excessively ? 

A. Carbohydrate 

B. Protein 

C. Fat 

D. Vitamin 

Question 3) 

Which nutriment group is most important as regulator in 
our body? 

A. Protein 

B. Fat 

C. Carbohydrate 

D. Water 

Question 4) 

What kinds of nutrient do we get from the energy that our 
body needs primarily to think, talk, walk, play sports, and 
so on? 

A. Protein 

B. Vitamin 

C. Carbohydrate 

D. Fat 


Question 5) 

Think about the nutrient groups that found excessively on 
animal nutrient such as meat, milk, egg, fish and cheese. 

Which of these below is not one of the primary duties of the 
nutrient group you think about? 

A. It provides growth and development 

B. It has an important role in the development of 
intelligence 

C. It has a serious role in the defence against germs 

D. They provide the energy that body 

Question 6) 

“ Citrus, strawberry, tomato, parsley, cabbage, rosehip” 

Think about the type of the vitamin found in these kinds of 
nutrients above. Which one of the following disorders may 
come in the lack of this type of vitamin? 

A. Increase in teeth and gum problems 

B. Haemophilia 

C. Anaemia, fatigue, scars on the skin 

D. Liver, cardiovascular diseases 

Question 7) 

If you think about the types of vitamins contained in the 
following food groups, which of the given nutrient groups 
contains the Vitamin K morel 

A. Citrus, tomato, strawberry 

B. Red meat, green vegetables, banana and peach 

C. Carrot, wheat, legume family and peanut 

D. Fish, dairy products 

Question 8) 

Which of the following is the group of nutrient that need to 
be taken every day for a healthy body and that are found in 
all nutrients as regulatory? 

A. Protein-Minerals 

B. Water-Minerals 

C. Carbohydrate-Water 

D. Protein-Water 
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Question 9) 

I. Used for energy 

II. They are not synthesized in our bodies but taken 
from the outside 

III. They exist in all the food we eat 

IV. They work as constructive and reparative. 

Which of the information above about water and minerals 
are true? 

A. I.-III.- IV. 

B. II.-IV. 

C. II.-III. 

D. III.-IV. 

Question 10) 

The doctor asks Betiil, who is sick, what she eats. 

Betiil: 

often eat hamburger and drink coke .” 

Doctor: 

-“If you keep eating like that, you are going to get a 
skin breakdown and feel fatigue ” 

What is the doctor trying to say Betiil basically ? 

A. She should have a vitamin-based diet instead of 
carbohydrate 

B. She should have a protein-based diet 

C. She consumes too much vitamin 

D. To eat one type of food is unhealthy. 

Question 11) 

Which of these below does not belong to a person who eats 
properly? 

A. He consumes less food and loses weight 

B. Resistant to diseases 

C. Body cells and tissues renew themselves 

D. Heart-attack risk is unlikely 


Question 12) 

“Cenk has an every-day diet based on carbohydrate .” 

If he keeps eating like that, which one is the most likely to 
happen to his body? 

A. Excess carbohydrate improves his body muscles 

B. Excess carbohydrate is stored as vitamin 

C. Excess carbohydrate turns into fat and make him gain 
weight 

D. As excess carbohydrate is stored, body lacks energy 

Question 13) 

When Kayra goes to the supermarket for shopping which of 
the following is an inappropriate behaviour for a healthy 
diet? 

A. Checking the TSI (Turkish Standard Institution) logo on 
the product 

B. Choosing the products with additives 

C. Preferring natural food to frozen food 

D. Checking the best before dates 

Question 14) 

Which of the things below is not suitable for a balanced 
nutrition? 

A. We should only eat fruits and vegetables. 

B. We should drink plenty of water 

C. We should consume different nutrients on every meal 

We should consume nutrients in proper amount according to 
the age and physical activity 

Question 15) 

Which of the things below is not one of the harms of 
smoking? 

A. It causes lung and laryngeal cancer 

B. Cardiovascular diseases 

C. Causes difficulty in talking and slow reflexes 

D. Causes to skin breakdown 
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Question 16) 

Which of the statements below is not one of the harms of 
alcohol? 

A. It weakens the will be negatively affecting the nervous 
system 

B. It affects the brain, muscles and veins adversely 

C. Makes you sleepy and brings an order to sleeping 
pattern 

D. It makes it hard to control the behaviours and senses 

Question 17) 

Which of the following is the body and structure that 
alcohol affects the most, negatively? 

A. Lung-Circulatory 

B. Heart- Vein 

C. Liver-Nerve 

D. Kidney-Urinary 

Question 18) 

Which one of the following is the route of food in the 
digestive system after stomach? 

A. Kidney-Small intestine-Large intestine 

B. Small intestine-Large intestine 

C. Kidney-Small intestine-Large intestine-urinaiy 
incontinence 

D. Small intestine-Large intestine-Urinary incontinence 

Question 19) 

Which of the following are physically disintegrating foods 
in the digestive system in the human body and the 
remaining waste after digestion is thrown out? 

A. Pharynx-Stomach 

B. Stomach-Large intestine 

C. Pharynx-Large intestine 

D. Mouth-Anus 


Question 20) 

I. II. 



Which of the organs given above are the organs in charge of 
digestion? 

A. I-II 

B. II-III 

C. I-III 

D. I-IV 
Question 21) 

Which of the following is the largest number of teeth in an 
adult individual who can crush and grind food? 

A. Molar tooth 

B. Incisor tooth 

C. Wisdom tooth 

D. Dog tooth 

Question 22) 

Koray used the dough to make a tooth model for the 
assignment given by the teacher. For this, 4 blue, 8 yellow 
and 16 red coloured game hurries are used. 

Which teeth do the colours in Koray’s tooth model 
represent? 

Blue Yellow Red 


A. 

Incisor 

Dog 

Molar 

B. 

Molar 

Dog 

Incisor 

C. 

Dog 

Incisor 

Molar 

D. 

Dog 

Molar 

Incisor 
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Which kinds of teeth below are shown with arrows in the 
mouth model above? 


In the tooth model given above, which of the following duties of 
the dental types below indicated by numbers are correctly 
given? 


A. 

I. 

II. 

III. 

B. 

II. 

III. 

I. 

C. 

III. 

II. 

I. 

D. 

II. 

I. 

III. 


I II III 


A. Dog tooth Molar tooth 

B. Incisor tooth Dog tooth 

C. Dog tooth Incisor tooth 

D. Incisor toot Molar tooth 


Incisor tooth 
Molar tooth 
Molar tooth 
Dog tooth 


Question 24) 



Consuming too hot and cold 
foods not affects on dental 
health, but the waste of food in 
mouth affects on dental health 



Question 26) 



Which of the following statements about decayed teeth is 

wrong? 

A. Decayed tooth damages the heart. 

B. Germs on decaying tooth cause diseases in internal organs. 

C. Newly decaying teeth should be removed. 

D. Decayed tooth causes foul breath in mouth 


Which of the above-mentioned statements of Elif can be 
shown as an example of correct behaviour to protect 
mouth and dental health? 


A. Berna 

B. Faruk 

C. Elif 

D. All of them 
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Question 27) 

Which of the following statements the doctor advises to 
Berk who has toothache is not correct? 

A. You should consume less milk and dairy products 

B. You should use fluoride toothpaste 

C. You should consume fresh fruit 

D. Brush your teeth three times a day 

Question 28) 

Which of the following is the part of the body by which 
food that is digested in our bodies, water, vitamins and 
minerals absorbed into the circulation system? 

A. Kidney 

B. Small intestine 

C. Stomach 

D. Gullet 

Question 29) 

After the digested food becomes shattered and absorbed, in 
which way are the beneficial parts carried on the body? 

A. Passes to stomach to get reabsorbed 

B. It spreads throughout the body through the large 
intestine. 

C. It is transported through the liver to the body 

D. It spreads to the whole body with blood circulation 

Question 30) 


Which one of the following is the system which helps the 
nutrients mix with blood and the organ where water and 
mineral are absorbed? 

A. Circulation - Small intestine 

B. Urinary - large intestine 

C. Urinary - Kidney 

D. Digestion - Large intestine 


Question 31) 

Which of the following is the ureter’s duty? 

A. The short pipe that the urine is thrown out 

B. Place where the blood is filtered 

C. The place where the urine is collected 

D. The conduit carrying the urine from the kidneys to the 
urine 

Question 32) 


Which of the following does not play a major role in 
disposing waste and residual substances in the body? 

A. Sweating 

B. Digestion 

C. Breathing 

D. Urine formation 

Question 33) 

Which of the following is not an organ that helps to remove 
waste from your body? 

A. Stomach 

B. Lung 

C. Skin 

D. Kidney 

Question 34) 

“Urea - Oxygen - Sweat - Carbon Dioxide - Urine’’'’ 

How many of the above are waste materials that are formed 
in the human body? 

A. 1 B. 2 C. 3 D. 4 

Question 35) 


Which of the following should not be done for the health of 
the drainage system? 

A. We should drink plenty of water. 

B. We must wash our hands with soap after toilet. 

C. We have to bathe frequently. 

D. When we have toilet, we should keep our urine. 
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Question 36) 


Question 37) 


I. II. 



“The doctor has some suggestions for Merve’s father who 
suffers from kidney failure and he says he should pay attention 
to his life ." 

Which one of the following is harmful? 

A. Eat his meal too much salty 

B. Avoiding cold and especially getting cold feet 

C. Drink plenty of water 

D. Avoid doing sports 


III. IV. 



Which of the above organs and structures are urinary system 
organs? 


Question 38) 


Kaan will prepare a poster to protect the kidneys’ health. 
Which of the following cannot be one of the items that 
should be mentioned on the poster? 

A. Treatment of tooth decay 

B. Eat bitter and spicy foods 

C. Clean water and cleaned food 

D. Cold protection of urinary tracts 


A. III-IV 

B. I-II-IV 

C. II-III 

D. I-II-III 
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